Coreference Resolution Evaluation Based on Descriptive Specificity

نویسندگان

  • François Trouilleux
  • Éric Gaussier
  • Gabriel G. Bès
  • Annie Zaenen
چکیده

This paper introduces a new evaluation method for the coreference resolution task. Considering that coreference resolution is a matter of linking expressions to discourse referents, we set our evaluation criteron in terms of an evaluation of the denotations assigned to the expressions. This criterion requires that the coreference chains identified in one annotation stand in a one-to-one correspondence with the coreference chains in the other. To determine this correspondence and with a view to keep closer to what human interpretation of the coreference chains would be, we take into account the fact that, in a coreference chain, some expressions are more specific to their referent than others. With this observation in mind, we measure the similarity between the chains in one annotation and the chains in the other, and then compute the optimal similarity between the two annotations. Evaluation then consists in checking whether the denotations assigned to the expressions are correct or not. New measures to analyse errors are also introduced. A comparison with other methods is given at the end of the paper. Identifying expressions which, in a text, denote the same discourse referent is usually considered a key process in automatic information extraction. However, the question of how to evaluate coreference resolution systems has sometimes been an issue: after the publication by Vilain et al. (1995) of a new evaluation scoring scheme for the Message Understanding Conferences, Popescu-Bellis et al. (1998) and Bagga et al. (1998) each proposed new evaluation methods. In this paper, we, in turn, propose a new evaluation method for the coreference resolution task. A coreference chain is defined by the property the expressions it contains have to denote a specific discourse referent (1). Our evaluation method so aims at evaluating coreference resolution with respect to this property, the problem being to evaluate whether the discourse referents associated with the expressions are the correct ones (2). From this setting of coreference resolution evaluation in terms of denotation assignment, one derives some constraints on the way two annotations should correspond; in particular, we observe that the fact that some expressions in a coreference chain are more specific to their referent than others has to be taken into account (3). The implementation of our evaluation method meets our requirements by computing the optimal similarity between the coreference chains in two annotations using a linear combination of Dice coefficients over some subsets of coreference chains (4). The recall and precision measures then express a comparison of two sets of denotation assignments. Three complementary measures for errors analysis are also proposed (5). Finally, we show how our evaluation method relates with existing ones (6).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

Corefrence resolution with deep learning in the Persian Labnguage

Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...

متن کامل

Coreference resolution: an empirical study based on SemEval-2010 shared Task 1

This paper presents an empirical evaluation of coreference resolution that covers several interrelated dimensions. The main goal is to complete the comparative analysis from the SemEval-2010 task on Coreference Resolution in Multiple Languages. To do so, the study restricts the number of languages and systems involved, but extends and deepens the analysis of the system outputs, including a more...

متن کامل

Coreference Resolution with Reconcile

Despite the existence of several noun phrase coreference resolution data sets as well as several formal evaluations on the task, it remains frustratingly difficult to compare results across different coreference resolution systems. This is due to the high cost of implementing a complete end-to-end coreference resolution system, which often forces researchers to substitute available gold-standar...

متن کامل

Coreference Resolution Strategies from an Application Perspective

As part of our TIPSTER III research program, we have continued our research into strategies to resolve coreferences within a free text document; this research was begun during our TIPSTER II research program. In the TIPSTER II Proceedings paper, "An Evaluation of Coreference Resolution Strategies for Acquiring Associated Information," the goal was to evaluate the contributions of various techni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000